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I Glossary 

• Chaos: Chaos occurs in dynamical systems with two ingredients: (i) nonlinear recurrent re-injection 
of the dynamics into a finite domain in phase space and (ii) exponential sensitivity of the trajectories in 
phase space to initial conditions. 
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Continuous phase transitions: If there is a finite discontinuity in the first derivative of the thermody- 
namic potential, then the phase transition is termed first-order. During such a transition, a system either 
absorbs or releases a fixed amount of latent heat (e.g. the freezing/melting of water/ice). If the first 
derivative is continuous but higher derivatives are discontinuous or infinite, then the phase transition 
is called continuous, of the second kind, or critical. Examples include the critical point of the liquid- 



gas transition, the Curie point of the ferromagnetic transition, or the superfluid transition | Landau and 
\Lifshitz\ [T9801 \Yeomans\ [T992) . 

• Critical exponents: Near the critical point, various thermodynamic quantities diverge as power laws 
with associated critical exponents. In equilibrium systems, there are scaling relations that connect some 
of the critical exponents of different thermodynamic quantities [ Cardy] |1996 Landau and Lifshltz\ 
1980} \Sornetie\ [2004} ^tan!ey\ [1999] \Yeomans\ [1992) . 



• Critical phenomena: Phenomena observed in systems that undergo a continuous phase transition. 
They are characterized by scale invariance: the statistical properties of a system on one scale are 
related to those at another scale only through the ratio of the two scales and not through anyone of the 
two scales individually. The scale invariance is a result of fluctuations and correlations at all scales, 



which prevents the system from being separable in the large scale limit at the critical point \Cardy 
[19961 \Somette\ [20041 \Ymhians\ | 1 992| . 

• Declustering: In studies of seismicity, declustering traditionally refers to the deterministic identifi- 
cation or fore-, main- and aftershocks in sequences (or clusters) of earthquakes clustered in time and 
space. Recent, more sophisticated techniques, e.g. stochastic declustering, assign to earthquakes prob- 
abilities of being triggered or spontaneous. 

• Dynamical scaling and exponents: Non-equilibrium critical phase transitions are also characterized 
by scale invariance, scaling functions and critical exponents. Furthermore, some evidence supports 
the claim that universality classes also exist for non-equilibrium phase transitions (e.g. the directed 
percolation and the Manna universality class in sandpile models), although a complete classification 
of classes is lacking and may in fact not exist at all. Much interest has recently focused on directed 
percolation, which, as the most common universality class of absorbing state phase transitions, is ex- 
pected to occur in many physical, chemical and biological systems \Hinrichsen 2000 Luebeck 2004 
\Sornette\ [20041 . 

• Finite size scaling: If a thermodynamic or other quantity is investigated at the critical point under a 
change of the system size, the scaling behavior of the quantity with respect to the system size is known 
as finite size scaling | Cardy 1996| . The quantity may refer to a thermodynamic quantity such as the 



free energy or it may refer to an entire probability distribution function. At criticality, the sole length 
scale in a finite system is the upper cut-off s c , which diverges in the thermodynamic limit L — ► oo. 
Assuming a lower cut-off so <^ s c s, a finite size scaling ansatz for the distribution P(s; s c ) of the 
observable variable s, which depends on the upper cut-off s c is then given by: 

P(s;s c ) = as~ T G(s/s c ) for s,s c >s (1) 

where the parameter a is a non-universal metric factor, t is a universal (critical) exponent, and G is 



a universal scaling function that decays sufficiently fast for s>s c | Cardy 1996 Christensen et al. 



2007) . Pruessner |2004| provides a simple yet instructive and concise introduction to scaling theory 



and how to find associated exponents. System-specific corrections appear to sub-leading order. 
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• Fractal: A deterministic or stochastic mathematical object that is defined by its exact or statistical self- 
similarity at all scales. Informally, it often refers to a rough or fragmented geometrical shape which 
can be subdivided into parts which look approximately the same as the original shape. A fractal is 
too irregular to be described by Euclidean geometry and has a fractal dimension that is larger than its 
topological dimension but less than the dimension of the space it occupies. 

• Mean-Field: An effective or average interaction field designed to approximately replace the interac- 
tions from many bodies by one effective interaction which is constant in time and space, neglecting 
fluctuations. 

• Mechanisms for power laws: Power laws may be the hallmark of critical phenomena, but there are 
a host of other mechanisms that can lead to power laws (see Chapter 14 of \Sorneite\ |2004[ for a list 



of power law mechanisms as well as Mitzenmacher [2004); Clauset et al. | 2007| ). Observations of 
scale invariant statistics therefore do not necessarily imply SOC, of course. Power laws express the 
existence of a symmetry (scale invariance) and there are many mechanisms by which a symmetry can 
be obtained or restored. 

• Non-equilibrium phase transitions: In contrast to systems at equilibrium, non-equilibrium phase 
transitions involve dynamics, energy input and dissipation. Detailed balance is violated and no known 
equivalent of the partition function exists, from which all thermodynamic quantities of interest derive in 
equilibrium. Examples of non-equilibrium phase transitions include absorbing state phase transitions, 
reaction-diffusion models, morphological transitions of growing surfaces, and percolation in porous 
media \Hinrichsen\ [20051 \Luebeck\ [20041 . 

• Phase transitions: In (equilibrium) statistical mechanics, a phase transition occurs when there is a 
singularity in the free energy or one of its derivatives. Examples include the freezing of water, the 
transition from ferromagnetic to paramagnetic behavior in magnets, and the transition from a normal 
conductor to a superconductor [Landau and Lifshitz 1980, Yeomans 1992) . 



Renormalization group theory: A mathematical theory built on the idea that the critical point can be 
mapped onto a fixed point of a suitably chosen transformation on the system's Hamiltonian. It provides 
a foundation for understanding scaling and universality and provides tools for calculating exponents 
and scaling functions. Renormalization group theory provides the basis for our understanding of criti- 
cal phenomena [ Cardy 1996, Stanley 1999; Yeomans 1992) . It has been extended to non-Hamiltonian 
systems and provides a general framework for constructing theories of the macro-world from the mi- 
croscopic description. 



Self-Organized Criticality (SOC): Despite two decades of research since its inception by \Bak et al. 
1 1987| and the ambitious claim by Bak | 1996] that, as a mechanism for the ubiquitous power laws 



in Nature, SOC was "How Nature Works", a commonly accepted definition along with necessary and 
sufficient conditions for SOC is still lacking \Jensen 1998 Pruessner 2004[ Sornette 2004). A less 
rigorous definition may be the following: Self-organized criticality refers to a non-equilibrium, critical 
and marginally stable steady-state, which is attained spontaneously and without (explicit) tuning of 
parameters. It is characterized by power law event distributions and fractal geometry (in some cases) 
and may be expected in slowly driven, interaction-dominated threshold systems \Jensen 1998) . Some 
authors additionally require that temporal and/or spatial correlations decay algebraically (e.g. \Her- 



garten 2002 1, but see Pruessner | 2004| ). Definitions in the literature range from broad (simply the 



absence of characteristic length scales in non-equilibrium systems) to narrow (the criticality is due to 
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an underlying continuous phase transition with all of its expected properties) (see, e.g., Peters and 



Neelin 1 2006 1 for evidence that precipitation is an instance of the latter definition of SOC in which a 



non-linear feedback of the order parameter on the control parameter turns a critical phase transition 



into a self-organized one attracting the dynamics \ Sornette 1992 1). 



• Spinodal Decomposition: In contrast to the slow process of phase separation via nucleation and slow 
growth of a new phase in a material inside the unstable region near a first-order phase transition, spin- 
odal decomposition is a non-equilibrium, rapid and critical-like dynamical process of phase separation 
that occurs quickly and throughout the material. It needs to be induced by rapidly quenching the 
material to reach a sub-area (sometimes a line) of the unstable region of the phase diagram which is 
characterized by a negative derivative of the free energy. 

• Statistical physics is the set of concepts and mathematical techniques allowing one to derive the large- 
scale laws of a physical system from the specification of the relevant microscopic elements and of their 
interactions. 

• Turbulence: In fluid mechanics, turbulence refers to a regime in which the dynamics of the flow in- 
volves many interacting degrees of freedoms, and is very complex with intermittent velocity bursts 
leading to anomalous scaling laws describing the energy transfer from injection at large scales to dissi- 
pation at small scales. 

• Universality: In systems with little or no frozen disorder, equilibrium continuous phase transitions fall 
into a small set of universality classes that are characterized by the same critical exponents and certain 
scaling functions become identical near the critical point. The class depends only on the dimension of 
the space and the dimension of the order parameter. For instance, the critical point of the liquid-gas 
transition falls into the same universality class as the 3D Ising model. Even some phase transitions 
occurring in high-energy physics are expected to belong to the Ising class. Universality justifies the 
development and study of extremely simplified models (caricatures) of Nature, since the behavior of 
the system at the critical point can nevertheless be captured (in some cases exactly). However, non- 
universal features remain even at the critical point but are less important, e.g. amplitudes of fluctuations 



or system-specific corrections to scaling that appear at sub-leading order | Cardy 1996 Stanley 1999 
\Yeomans\ [19921 \Zee\ |2003) . 



II Definition and Importance of the Subject 

A fundamental challenge in many scientific disciplines concerns upscaling, that is, of determining the regu- 
larities and laws of evolution at some large scale from those known at a lower scale: biology (from molecules 
to cells, from cells to organs); neurobiology (from neurons to brain function), psychology (from brain to emo- 
tions, from evolution to understanding), ecology (from species to the global web of ecological interactions), 
condensed matter physics (from atoms and molecules to organized phases such as solid, liquid, gas, and in- 
termediate structures), social sciences (from individual humans to social groups and to society), economics 
(from producers and consumers to the whole economy), finance (from investors to the global financial mar- 
kets), Internet (from e-pages to the world wide web 2.0), semantics (from letters and words to sentences and 
meaning), and so on. Earthquake physics is no exception, with the challenge of understanding the transition 
from the laboratory scale (or even the microscopic and atomic scale) to the scale of fault networks and large 
earthquakes. 
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In this context, statistical physics has had a remarkably successful track record in addressing the upscal- 
ing problem in physics. While the macroscopic laws of thermodynamics have been established in the 19th 
century, their microscopic underpinning were elaborated in the early 20th century by Boltzmann and fol- 
lowers, building the magnificent edifice of statistical physics. Statistical physics can be defined as the set of 
concepts and mathematical techniques allowing one to derive the large-scale laws of a physical system from 
the specification of the relevant microscopic elements and of their interactions. Dealing with huge ensembles 
of elements (atoms, molecules) of the order of the Avogadro number (~ 6 • 10 23 ), statistical physics uses 
the mathematical tools of probability theory combined with other relevant fields of physics to calculate the 
macroscopic properties of large populations. 

One of the greatest achievement of statistical physics was the development of the renormalization group 
analysis, to construct a theory of interacting fields and of critical phase transitions. The renormalization group 
is a perfect example of how statistical physics addresses the micro-macro upscaling problem. It decomposes 
a problem of finding the macroscopic behavior of a large number of interacting parts into a succession of 
simpler problems with a decreasing number of interacting parts, whose effective properties vary with the 
scale of observation. The renormalization group thus follows the proverb "divide to conquer" by organizing 
the description of a system scale-by-scale. It is particularly adapted to critical phenomena and to systems 
close to being scale-invariant. The renormalization group translates into mathematical language the concept 
that the overall behavior of a system is the aggregation of an ensemble of arbitrarily defined sub-systems, 
with each sub-system defined by the aggregation of sub-subsystems, and so on \Sornette, 2004]. 

It is important to stress that the term "statistical" in "statistical physics" has a very different meaning, up 
to now, than in "statistical seismology," a field that has developed as a marriage between probability theory, 



statistics and the part of seismology concerned with empirical patterns of earthquake occurrences | Vere-Jones 



2006| (but not with physics). Statistical seismology uses stochastic models of seismicity, which are already 
effective large-scale representation of the dynamical organization. In contrast, a statistical physics approach 
to earthquake strives to derive these statistical models or other descriptions from the knowledge of the mi- 
croscopic laws of friction, damage, rupture, rock-water interations, mechano-chemistry and so on, at the 
microscopic scales \Sornette 1999, 2000 1. In other words, what is often missing in statistical seismology is 



the physics to underpin the stochastic model on physically-based laws, e.g. rate-and-state friction \ Dieterich 
[T9941 

The previously mentioned successes of statistical physics promote the hope that a similar program can 
be developed for other fields, including seismology. The successes have been more limited, due to the much 
more complex interplay between mechanisms, interactions and scales found in these out-of-equilibrium sys- 
tems. This short essay provides a subjective entry to understand some of the different attempts, underlining 
the few successes, the problems and open questions. Rather than providing an exhaustive review, we mention 
what we believe to be important topics and have especially included recent work. 



Ill Introduction 



Much of the recent interest of the statistical physics community has focused on applying scaling techniques, 
which are common tools in the study of critical phenomena, to the statistics of inter-event recurrence times 
or waiting times §Bak et al. 2002 Corral 2003 2004a b 2005 a]b) Corral and Christensen 2006[ Davidsen 



and Goltz 2004 Livina et al. 2006| . However, the debate over the relevance of critical phenomena to 



earthquakes stretches back as far as 30 years | Vere-Jones \911\\Allegre et al\\\9%2\\Smalley et q/.||1985 



Kagan 1989 ; Sornette and Sornette 1989 Bale and Tang 1989 [ ,\Sornette a nd Sornette 1999; Kagan., 1992a 
Olami et al, 1 1 992| |ATflgara| 1 1 994a[ Sornette and Sammis l995\\Saleur et a/.||1996a||Z?qfc||1996||.Fis/ier et al. 



1997| \Dahmen et al.\ [19981 \Jensen\ [T9981 \Nature Debates] [19991 \Hergarten] [2002) \5U?n7tie] [2002] [20041 



Kagan 2006 ; Dahmen and Ben-Zion , 2008 ; Zoeller et at. , 2008 1. The current debate on recurrence statistics 



is thus the latest tack in an evolving string of arguments with a long history. As discussed below, many of the 
claims made in the recent articles on recurrence statistics have either been challenged, refuted or explained 
by previously known facts about earthquake statisti cs \Lindman et al. 2005 2006| Molchan 2005 Saichev 



\and Sornette\ [2006b 2007, Wern er and Sornelie] |2007 1. As will be discussed below, this debate in the 
literature is important because of the potential consequences for understanding earthquakes, but it needs to 
be pursued with rigorous scientific arguments accessible to both the seismological and the statistical physics 
communities. 

The debate would almost certainly benefit significantly from testing hypotheses with simulations to estab- 
lish null hypotheses and benchmarks: seismicity patterns are sufficiently stochastic and earthquake catalogs 
contain a sufficient amount of observational uncertainties so as to make inference difficult. It is often not 
straightforward to predict the signal of well-known statistical features such as clustering in new data anal- 
ysis techniques. Therefore, testing the purported claims by realistic simulations of earthquake catalogs can 
provide a strong benchmark against which the claims can be evaluated. This view and the corresponding 
criticism of many studies has been put forward and defended for a long time by Kagan [ 1999]. 

Such a model-dependent approach may be at odds with the philosophy of a so-called "model-free" analy- 
sis, which the community of statistical physicists claim to take in their analysis. For instance, network theory- 
based approaches, space-time window-based finite size scaling, box-covering methods and other techniques 
used in the study of critical and fractal phenomena are said to be "model-free" because no assumptions about 
seismicity are supposedly made at the outset. By using model-free analysis techniques, the often uncertain 
and sometimes clearly wrong assumptions of flawed models and resulting biased results are meant to be 
circumvented. 

However, as is almost always the case in statistical hypothesis testing, the less assumptions are made about 
the test, the less powerful the test statistic. More importantly, seismicity is sufficiently stochastic so that well- 
known features may appear as novel in new analysis methods. Furthermore, to convince the seismological 
community of new data analysis techniques, the methods need to be tested on established knowledge and 
show the improvement over traditional methods. These types of initial tests are rarely performed by the 
statistical physics community. 

In the next section IV, we present a summary of some of the concepts and calculational tools that have 
been developed in attempts to apply statistical physics approaches to seismology. Then, section V summarizes 
the leading theoretical physical models of the space-time organization of earthquakes. Section VI presents 
a general discussion and several examples of the new metrics proposed by statistical physicists, underlining 
their strengths and weaknesses. Section VII briefly outlines future directions. 



IV Concepts and Calculational Tools 

IV. 1 Renormalization, Scaling and the Role of Small Earthquakes in Models of Trig- 
gered Seismicity 

A common theme in many of the empirical relations in seismology (and in those employed in seismicity 
models) is the lack of a dominating scale. Many natural phenomena can be approached by the traditional 
reductionist approach to isolate a process at a particular scale. For example, the waves of an ocean can be 
described quite accurately by a theory that entirely ignores the fact that the liquid is made out of individual 
molecules. Indeed, the success of most practical theories in physics depends on isolating a scale | |Wi7jfow| 
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1979|, although since this recognition, much progress has been made in developing a holistic approach for 



processes that do not fall into this class. Given current observational evidence, earthquakes seem to belong to 
the set of processes characterized by a lack of one dominating length scale: fluctuations of many or perhaps 
a wide continuum of sizes seem to be important and are in no way diminished - even when interested solely 
in large-scale descriptions \ Sornette and Pisa renko 2003). 

The traditional reductionist approach in seismology, which, for instance, attempted to separate large 
(main) shocks from small (fore- or after-) shocks, is slowly giving way to the holistic approach, in which 
all earthquakes are created equal and seismicity is characterized by fluctuations of all sizes. This gradual 
shift is supported, on a conceptual and qualitative level, by the vision of critical phenomena. A particularly 
strong model of the interactions between earthquakes has emerged in the concept of triggering, which places 
all earthquakes on the same footing: each earthquake can trigger its own events, which in turn can trigger 
their own events, and so on, according to the same probability distributions, and the resulting seismicity can 



be viewed as the superposed cascades of triggered earthquakes that cluster in space and time \Kagan and 
Knopoff\ [T98T] \Ogata\ [1988] [19981 \Hdmstetter and Sornette\ [2002) . 



From this point of view, it is natural that small earthquakes are important to the overall spatio-temporal 
patterns of seismicity. Indeed, the scaling of aftershock productivity with mainshock magnitude suggests that 
small earthquakes are cumulatively as important for the triggered seismicity budget as rarer but larger events 
\Felzer et al. 2002 Helmstetter 2003UHelmstetter et al. 2005) . The importance of small earthquakes has 
also been documented in, e.g., | Hanks\\\992 \Michael and 7oHes||1998||Marscm||2005| . 

But earthquake catalogs do not contain information (by definition) about the smallest, unobserved events, 
which we know to exist from acoustic emission experiments and earthquakes recorded in mines. To guarantee 
a finite seismicity budget, \Sornette and W erner 1 2005a | argued for the existence of a smallest triggering earth- 
quake, akin to a "ultra-violet cut-off in quantum field theory, below which earthquakes do not trigger other 
events. Introducing a formalism which distinguishes between the detection threshold and the smallest trigger- 
ing earthquake, they placed constraints on its size by using a simplified version of the popular Epidemic-Type 
Aftershock Sequence (ETAS) Model [Ogata 1988) , a powerful model of triggered seismicity based on em- 
pirical statistics, and by using observed aftershock sequences. Sornette and Werner | 2005b | considered the 
branching structure of one complete cascade of triggered events, deriving an apparent branching ratio and 
the apparent number of untriggered events, which are observed when only the structure above the detection 
threshold is known. As a result of our inability to observe the entire branching structure, inferred clustering 
parameters are significantly biased and difficult to interpret in geophysical terms. Second, separating trig- 
gered from untriggered events, commonly known as declustering, also strongly depends on the threshold, so 
that it cannot even in theory constitute a physically sound method. 

Sornette and Werner 1 2005b | also found that a simplified, averaged version of the ETAS model can be 



renormalized onto itself, with effective clustering parameters, under a change of the threshold. Saich ev and] 



Sornette [2006a] confirmed these results for the stochastic number statistics of the model using a rigorous 



approach in terms of generating probability functions, but also showed that the temporal statistics could not 
be renormalized. Furthermore, it can be shown (see Chapter 4 of Werner |2007 1) that the conditional intensity 
function of the ETAS model, the mathematical object which uniquely defines the model, cannot be renormal- 
ized onto itself under a change of magnitude threshold. It is not a fixed-point of the renormalization process 
operating via magnitude coarse-graining. The functional form of the model must change under a change in 
the detection threshold | Saichev and Sornette 2006a |. In other words, if earthquakes occur according to an 
ETAS model above some cut-off mo, then earthquakes above cannot be described by the ETAS model in 
a mathematically exact way. In practice, the ETAS model provides an excellent fit. The issue of how to deal 
with small earthquakes is thus reminiscent of the decades of efforts that have been invested in physics to deal 
with the famous ultra-violet cut-off problem, eventually solved by the so-called "renormalization" theory of 
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Feynmann, Schwinger and Tomonaga. In the 1960s and 1970s, this method of renormalization was extended 
into the "renormalization group" (in fact a semi-group in the strict mathematical sense) for the theory of crit- 
ical phenomena (see glossary), which we also mention in section V. It is fair to say that there has been limited 
success in developing a multi-scale description of the physics of earthquakes and, in particular, in addressing 
the upscaling problem and the impact of the many small earthquakes 

One tantalizing approach, not yet really understood in terms of all its consequences and predictions, 



is the variant of the ETAS model proposed by \Vere-Jones 1 2005 ], which has the remarkable property of 
being bi-scale invariant under a transformation involving time and magnitudes. One of the modifications 
brought in by \Vere-J ones |2005| is to assume that the distribution of the daughter magnitudes is dependent 
on the mother magnitude through a modification of the Gutenberg-Richter distribution of triggered earth- 
quake magnitudes by a term of the form exp(— 8\m — TOj|), where 6 > quantifies the distance to the 
standard Gutenberg-Richter distribution. Remarkably, \Saichev and Sorn ette |2005|, who studied the Vere- 
Jones model, found that, due to the superposition of the many magnitude distributions of each earthquake 
in the cascades of triggered events, the resulting distribution of magnitudes over a stationary catalog is a 
pure Gutenberg-Richter law. Thus, there might be hidden characteristic scales in the physics of triggering 
that are not revealed by the standard observable one-point statistical distributions. Simulation and parameter 
estimation algorithms for the Vere-Jones model are not yet available. If and when these algorithms become 
available, the study of this bi-scale invariant branching model may be a strong alternative to the ETAS model, 
as this model is exactly scale invariant with no ultra-violet nor infra-red cut-offs. 

Nevertheless, being empirically based, these stochastic point process models lack a genuine microscopic 
physical foundation. The underlying physics is not explicitly addressed and only captured effectively by 
empirical statistics, even at the smallest scales. The physical processes and their renormalization are missing 
in this approach. 



IV.2 Universality 

Universality, as defined in the glossary, justifies the development and study of extremely simplified models 
(caricatures) of Nature, since the behavior of a studied system at the critical point can nevertheless be cap- 
tured by toy models (in some cases exactly). For instance, the liquid-gas transition, the ferromagnetic to 
paramagnetic transition, and the behavior of binary alloys, all apparently different systems, can be described 
successfully by an extremely simplified picture of Nature (the Ising model) because of universality. The 
hope that a similar principle holds for earthquakes (and other non-equilbrium systems) underpins many of 
the models and tools inspired by statistical physics that have been applied to seismicity, to bring about the 
"much coveted revolution beyond reductionism" [Barabdsi et al. 1999| . 

Speaking loosely, the appearance of power laws in many toy models is often interpreted as a kind of 
universal behavior. The strict interpretation of universality classes, however, requires that critical exponents 
along with scaling functions are identical for different systems. It is interesting to note that slight changes in 
the sand-pile model already induce new universality classes, so that even within a group of toy models, the 
promise of universality is not, strictly, fulfilled | Bak et al. 1987 Kadanoff et al. , 1989 ; Manna , 1991 1. 

A lively debate in seismology concerns the universality of, on one hand, the frequency-size distribution 
of earthquake magnitudes (e.g. \Kagan 1993| Wesnousky 1994 Fisher et al.\\l997\\Dahmen et al. 1998 



\Kagan\ |1999| \Jackson and Kagan 2006 1, and, on the other hand, the universality of the exponent of the 



Gutenberg-Richter distribution (e.g. \Sornette et al. 1991 Bird and Kagan 2004 Wiemer and Katsumata 



\\999\ Schorle mmer et a/.||2005) ). A spatio-temporally varying critical exponent is not traditionally part of 
the standard critical phenomena repertoire, although analytical and numerical results based on a simple earth- 
quake model on a fault showed a possible spontaneous switching between Gutenberg-Richter and characteris- 



8 



tic earthquake behavior associated with a non-equilibrium phase transition | Fisher et al. 1997 Dahmen etal. 

Zoeller et al. 2008) . Another possible mecha- 



1998 Ben-Zion et al. 1999 Dahmen and Ben-Zion 2008 



nism for the coexistence of and intermittent shifts between different regimes (Gutenberg-Richter scaling and 
characteristic earthquakes) stems from the competition between several interacting faults which give rise to 



long intervals of activity in some regions followed by similarly long intervals of quiescence \Sornette et al. 



1994 1995 Lee et al. 1999] , Both careful empirical investigations of seismicity parameters and theoretical 
progress on heterogeneous, spatially extended critical phenomena may help elucidate the controversy. 



IV.3 Intermittent Periodicity and Chaos 



As part of the conquest of chaos theory in the 70s and 80s [May 1976| \Eckman\ 1981 1, its concepts and 
methods were invariably also applied to seismicity. Huang and Turcotte 1 1990a b] modeled the interaction 
between two faults by two sliding blocks that are driven by a plate through springs, coupled to each other via 
another spring and endowed with a velocity-weakening friction law. The dynamical evolution of the blocks 
showed chaotic behavior and period-doubling (the Feigenbaum route to chaos \Eckman 1981]). Huang and 
\Turcotte\ | |1990a|b[ suggested that the interaction of the Parkfield segment with the southern San Andreas 
fault may be governed by the kind of chaotic behavior they observed in their model: the lighter block slipped 
quasi-periodically for several times until both slipped together. Their study explained that apparent quasi- 
periodicity of earthquakes on fault segments may be a result of chaotic interactions between many fault 
segments, thereby providing a warning that the extrapolations of quasi-periodic models are not to be trusted 
(e.g. {Harris and Ramon 2006; Bakun ei al. | [2005 ] and references therein). However, it is doubtful that 
models with just a few degrees of freedom can go a long way towards providing deeper physical insights, or 
predictive tools. One needs to turn to models with a large number of degrees of freedom, for which turbulence 
appears as the leading paradigm of complexity. 



IV.4 Turbulence 



A drastically different approach has been favored by Yan Kagan \ Kagan] [1989] |1992a| |1994a| [2006| , who 
described seismicity as the "turbulence of solids" - attesting to the potentially far more complex problems 
that need to be solved than the theory of critical phenomena promises to deliver. While renormalization 



group methods and scaling theory have contributed immensely to the study of turbulence \ Frisch 1995 1, the 
problem of turbulence involves significant additional complications. First, loosely speaking, renormalization 
group theory helps predict global behavior by coarse-graining over degrees of freedom, which is essentially 
a bottom-up approach. In turbulence, the enstrophy acts bottom-up, but the energy cascades top to bottom. 
Secondly, there is a significant spatial and topological aspect to turbulence, for instance involving topological 
defects, such as filament structures, which are crucial to the dynamical evolution of the system. The exis- 
tence of the two cascades (top-down and bottom-up) as well as the influence of the dissipation scale all the 
way within the so-called inertial range makes turbulence the most important classical problem still unsolved 
in Physics. The importance of addressing the issue of the interplay between the top-down and bottom-up 



cascades in earthquake toy models has been outlined by Gabrielov et al. | 2000a|b| ; Zaliapin et a/.|p0~03a|b| 



The analogous problem for seismicity lies in the complex fault network, which constrains seismicity 
through its weak structures but also grows and evolves because of earthquakes redistributing stresses and 
rupturing fresh surfaces. The statistical description of this tensorial and dynamical problem is only at its 



beginning \Kagan and Knopoff 1980[|A'agcm||1981a|b 


Kagan and Knopoff , 1985a b Kagan 1987a|b|fl988 


1992b Miltenberger et al. 1993 Sornette et al. 1994 


Gabrielov et al. 1996). But it is likely to be a key 



aspect to the dynamical evolution of faults and seismicity. New physics and approaches are required to tackle 



9 



the tensorial nature of the stress and strain fields and the complex topological structures of defects, from 
dislocations to joints and faults, as well as the many different physical processes operating from the atomic 
scale to the macro-scale \Sornette\ [l 999[ |2000| . 



IV.5 Self- Organized Criticality 



Self-organized criticality (SOC) refers to the spontaneous organization of a system driven from outside into 
a globally dynamical statistical stationary state, which is characterized by self-similar distributions of event 
sizes and fractal geometrical properties. SOC applies to the class of phenomena occurring in driven out- 
of-equilibrium systems made of many interactive components, which possess the following fundamental 
properties : 1) a highly non-linear behavior, 2) a very slow driving rate , 3) a globally stationary regime, 
characterized by stationary statistical properties, and 4) power-law distributions of event sizes and fractal 



geometrical properties. The crust obeys these four conditions, as first suggested by \Sornette and Sornette 
1 1989| and Bak and Tang 1 1989], who proposed to understand the spatio-temporal complexity of earthquakes 
in this perspective. 

The appeal of placing the study of earthquakes in the framework of critical phenomena may be summa- 
rized as follows. Power law distributions can be understood as a result of an underlying continuous phase 
transition into which the crust has organized itself | Sornette et al. 1990 Sornette 1991 1. Applying the meth- 
ods of re normalization group theory may help calculate exponents and scaling functions and rationalize the 



spatio-temporal organization of seismicity along with its highly correlated structures. For instance, Sornette 
and Virieux 1 1992 1 provided a theoretical framework which links the scaling laws exhibited by earthquakes 
at short times and plate tectonic deformations at large times. Perhaps earthquakes fall into a universality 
class which can be solved exactly and/or investigated in toy models. Moreover, studying the detailed and 
highly complicated microphysics involved in earthquakes may not lead to insights about the spatio-temporal 
organization, because, as a critical phenomenon, the traditional approach of separating length scales to de- 
scribe systems is inadequate. On the other hand, as mentioned above, there is the possibility of a hierarchy of 
physical processes and scales which are inter-related \Ouillon et al. 1995 , 1996 1, for which the simplifying 
approach in terms of critical phenomena is likely to be insufficient. 

As another reason for the importance of the topic, interesting consequences for the predictability of earth- 
quakes might be derived, for instance by mapping earthquakes to a genuine critical point (the accelerating 
moment release hypothesis, e.g. ^S ornette and Sammis] 1 1995 Sornette 2002[ ) or by mapping earthquakes 
to SOC (e.g. | Geller et al. 1997| Nature Debates] |1999| ). The latter mapping had led some to argue that 



earthquakes are inherently unpredictable. In the sandpile paradigm | Bak et al. 1987[ , there is little difference 
between small and large avalanches, and this led similarly to the concept that "large earthquakes are small 
earthquakes that did not stop," hence their supposed lack of predictability. More than ten years after this 
contentious proposal, a majority of researchers, including most of the authors of this "impossibility claim," 
recognize that there is some degree of predictability \ Jordan\ |2006 ; Helmstet ter et a/.||2006| . Actually, the 
clarifications came from investigators of SOC, who recognized that the long-term organization of sandpiles 



\Dhar 1990 1999] and of toy models of earthquakes and fault networks [Miltenberger et al. 1993| Sor- 



nette et al. 1994 1995 1 is characterized by long-range spatial and temporal correlations. Thus, large events 



may indeed be preceded by subtle long-range organizational structures, an idea at the basis of the acceler- 
ating moment release hypothesis. This idea is also underlying the pattern recognition method introduced 
by Gelfand et al.\ (1976 1 and developed extensively by V. Keilis-Borok and his collaborators for earthquake 



predictions ^Keilis-Borok et al. 2003] . In addition, Huang et al. 1 1998 1 showed that avalanche dynamics 
occurring within hierarchical geometric structures are characterized by significant precursory activity before 
large events; this provides a clear proof of the possible coexistence between critical-like precursors of large 
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events and a long-term self-organized critical dynamical state. 

In summary, self-organized criticality provides a general conceptual framework to articulate the search 
for a physical understanding of the large-scale and long-time statistical properties of the seismogenic process 
and of the predictability of earthquakes. Beyond this, it is of little help as many different mechanisms have 
been documented at the origin of SOC (see, e.g., chapter 15 in \ Sornette 2004) . SOC is not a theory, it 
does not provide any specific calculation tools; it is a concept offering a broad classification of the kinds of 
dynamics that certain systems, including the Earth crust, seem to spontaneously select. 



V Competing mechanisms and models 

It should be noted at this point that the statistical physics approach to earthquake science is not limited to 
SOC. Over the years, several groups have proposed to apply the concepts and tools of statistical physics to 
understanding the multiscale dynamics of earthquake and fault systems. Various mechanisms drawn concep- 
tually from statistical mechanics but not necessary even limited to critical (phase transition) phenomena have 
been proposed and are being pursued. Such approaches include the concept of the critical point earthquake 
related to accelerated moment release, network theory, percolation and fiber models as models for fracture, 
and many more, some of which can be found in \Hergarten 2002, Sornette\ |2004| \Turcotte\\\991\ \Turcotte\ 
\eTal\ 12000) . 

In this section, we outline some of the major model classes which underpin distinct views on what are the 
dominating mechanisms to understand earthquakes and their space-time organization. 



V.l Roots of complexity in seismicity: dynamics or heterogeneity? 



The 1990s were characterized by vigorous discussions at the frontier between seismology and statistical 
physics aimed at understanding the origin of the observed complexity of the spatio-temporal patterns of 
earthquakes. The debate was centered on the question of whether space-time complexity can occur on a 
single homogeneous fault, solely as a result of the nonlinear dynamics [Shaw |1993| |1994| |1995| |1997 



\Cochard and M adariaga,, |1994| |1996[ \Langer et al.\ |1996| \Ben-Zion and Rice 



slip and velocity dependent friction law validated empirically in particular by Dieterich | 1987 



19971 



associated with the 
1994|; 



and Rice 1993||1995 \Knopoff\ [T996) 



1992 



Dieterich and Kilgore | 1994). O r, is the presence of quenched heterogeneity necessary $Rice 1993 Ben-Zion 



The rediscovery of the multi-slider-block-spring model of Burridge and Knopoff [ 1964 1 by \Carlson and\ 
hanger | 1989) led to a flurry of investigations by physicists | Carlson et al.\\\994\\Rundle and Klein\ 1995 



nd 
>5; 



Longer et al. 1996], finding an enticing entry to this difficult field, in the hope of capturing the main empirical 



statistical properties of seismicity. It is now understood that complexity in the stress field, in co-seismic slips 
and in sequences of earthquakes can emerge purely from the nonlinear laws. However, heterogeneity is 
probably the most important factor dominating the multi-scale complex nature of earthquakes and faulting 
\Ouillon et al. 1 995| 1 1 996} |5c/;ofe| 1 1 99 1 1 12002[ . It is also known to control the appearance of self-organized 
critical behavior in a class of models relevant to the crust \Sorne tte et q/.[ [l995 ; Shnirman and Blanter 1998 1. 



V.2 Critical earthquakes 

This section gives a brief history of the "critical earthquake" concept. 

We trace the ancestor of the critical earthquake concept to Vere- Jones | 1977), who us ed a branching model 
to illustrate that rupture can proceed through a cascade of damage events. Allegre et al. 1 1982] proposed what 
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is in essence a percolation model of damage/rupture describing the state of the crust before an earthquake. 
They formulated the model using the language of real-space renormalization group, in order to emphasize the 
multi-scale nature of the underlying physics, and the incipient rupture as the approach to a critical percolation 
point. Their approach is actually a reformulation in the language of earthquakes of the real-space renormal- 
ization group approach to percolation developed by Reynolds et al. | 1977 1. Chelidze | 1982 1 independently 
developed similar ideas. In the same spirit, Smalley et al. | 1985| pro posed a renormalization group treatment 
of a multi-slider-block-spring model. \Sorne tte and Sornette |1999| took seriously the concept put forward 
by \Allegre et a/.|p982) and proposed to test it empirically by searching for the predicted critical precursors. 
|Vb;g/if| | |1988||1989[ was probably the first author to introduce the idea of a time-to-failure analysis quantified 
by a second order nonlinear ordinary differential equation. For certain values of the parameters, the solution 
of \Voight 1 19881 [1989)' s time-to-failure equation takes the form of a finite time singularity (see Sammis 



and Sornette |2002| for a review and Sornette and Helmstetter [ 2002 1 for a mechanism based on the ETAS 
model). He proposed and did use it later to predict volcanic eruptions. The concept that earthquakes are 
somehow associated with critical phenomena was also underlying the research efforts of a part of the russian 
school \Keilis-Borok , 1990, Tumarkin and Shnirman 1992]. 

The empirical seed for the critical earthquake concept was the repeated observations that large earthquakes 
have been preceded by an increase in the number of intermediate size events \Keilis-Borok and Malinovskaya 



\96A\Mogi 


1969; Lindh 1990[ Ellsworth et al. 


1981 \\Raleigh et al] 


\m2\\Keilis-Borok et a/.||1988| Sykes 


and Jaume\\\99Q\\Jones 1994 Brehm and Braile 


l998\\Jaume and Sykes\ 1999|. The relation between these 



intermediate events and the subsequent main event took a long time to be recognized because the precursory 
events occur over such a large area. |5yfces and Jau me 1 199 0) proposed a s pecific law ~ exp[t/Y] quantifying 
the acceleration of seismicity prior to large earthquakes. \Bufe and Var nes [ 1993 1 proposed that the finite-time 
singularity law 

^Benioff ~ l/(*c — 1)™ (2) 

is a better empirical model than the exponential law. In (|2jl, eBeniof f is me cumulative Benioff strain, t c is 
critical time of the occurrence of the target earthquake and to is a positive exponent. The fit with this law 
of the empirical Benioff strain calculated by summing the contribution of earthquakes in a given space-time 
window is supposed to provide the time t c of the earthquake and thus constitutes a prediction. This expression 
Q was justified by a mechanical model of material damage. It is important to understand that the law |2]) can 
emerge as a consequence of a variety of mechanisms, as reviewed by Sammis and Sornette 1 2002 1 . 



One of these mechanisms has been coined the "critical earthquake" concept, first formulated by Sornette 



and Sammis [ 1995 1, who proposed to reinterpret the formula Q proposed by Bufe and Varnes 1 1993| and pre- 



vious related works by generalizing them within the statistical physics framework. This concept views a large 
earthquake as a genuine critical point. Using the insight of critical points in rupture phenomena, \Sornette and\ 
\Sammis\ (T995 1 proposed to enrich equation now interpreted as a kind of diverging susceptibility in the 
sense of critical phenomena, by considering complex exponents (i.e. log-periodic corrections to scaling). 
These structures accommodate the possible presence of a hierarchy of characteristic scales, coexisting with 
power laws expressing the scale invariance associated with a critical phenomenon 15omeffe|Fl998|. This was 



nette 



followed by several extensions | Saleur et al. 1996a Huang et al. 1998 Johansen et al. 1996 2000]. Sor- 



1 2002) r eviewed the concept of critical "ruptures" and earthquakes with application to prediction. Ide 



and Sornette | 2002 1 presented a simple dynamical mechanism to obtain finite-time singularities (in rupture 



in particular) decorated by complex exponents (log-periodicity). Bowman et al. | 1998 1; Ouillon and Sornette 



1 2000); Zoller et al. | 2001| ; Zoller and Hainzl [ 2002 1 ; Ouillon and Sornette [2004] proposed empirical tests 



of the critical earthquake concept. The early tests of \Bowman et al. | 1998] have been criticized by Hardebeck 



\et a/.||2"007) , while \Vere-Jones efa l. [2001 1 commented on the lack of a formal statistical basis of the accel- 
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erating moment release model. This stresses the need for rigorous tests in the spirit of \Rhoades and Evison 



2004 2005 1 . The debate is wide open, especially in view of the recent developments to improve the deter- 



mination of the relevant spatio-temporal domain that should be used to perform the analyses | Bowman and 
King 2001a|b King and Bowman , 2003 Levin et aL~\ 2006 1 (see \ Freund and Sornette 2007 1 for a review). 



V.3 Spinodal decomposition 

Klein, Rundle and their collaborators have suggested a mean-field approach to the multi-slider-block-spring 
model justified by the long-range nature of the elastic interactions between faults. This has led them to 
propose that the fluctuations of the strain and stress field associated with earthquakes are technically those 



occurring close to a spinodal line of an underlying first-order phase transition (see \Rundle and Klein 1993 
Klein and Rundle 1997 Rundle et al. \ |2003| and references therein). This conceptual view has inspired 



them to develop the "Pattern Informatics" technique, an empirical seismicity forecasting method based on 



the idea that changes in the seismicity rate are proxies for changes in the underlying stress | Tiampo et al. 
[2006] \Holliday etal.\ [2006| . 

The fluctuations associated with a spinodal line are very similar to those observed in critical phenomena. 
It is thus very difficult if not impossible in principle to falsify this hypothesis against the critical earthquake 
hypothesis, since both are expected to present similar if not identical signatures. Perhaps, the appeal of 
the spinodal decomposition proposal has to be found at the theoretical level, from the fact that first-order 
phase transitions are more generic and robust than critical phenomena, for systems where heterogeneity and 
quenched randomness are not too large. 



V.4 Dynamics, stress interaction and thermal fluctuation effects 



Fisher et al.\\\ 997| ; \Dahmen et al.\ fl99 8 1 ; Ben-Zion et al. | 1999 1 and co-workers (see the reviews by Dah 
men and Ben-Zion\ | |2008| and \Zoeller et al |2008|) have introduced a mean- field model (resulting from a 



uniform long-range Green function) of a single fault, whose dynamical organization is controlled by two 
control parameters, e which measures the dynamic stress weakening and c which is the deviation from stress 
conservation (due for instance to coupling with ductile layers). The point (e = 0; c = 0) is critical in the 
sense of a phase transition in statistical physics, with its associated scale invariant fluctuations described by 
power laws. Dynamic stress strengthening (e < 0) leads to truncated Gutenberg-Richter power laws. Dy- 
namic stress weakening (e > 0) is either associated with a truncated Gutenberg-Richter power law for c > 
or with characteristic earthquakes decorating a truncated power law for c < 0. The coexistence of a char- 
acteristic earthquake regime with a power-law regime is particularly interesting as it suggests that they are 
not exclusive properties but may characterize the same underlying physics under slightly different conditions. 



This could provide a step towards explaining the variety of empirical observations in seismology | Schwartz 
\and Copperilhlih] [1984] \Wesnousky\ [1994) \Kagan\ |l 994b[ \Aki\ [T995] . 



Sornette et al. |1995| have obtained similar conclusions using a quasi-static model in which faults grow 
and self-organize into optimal structures by repeated earthquakes. Depending on the value of dynamical 
stress drop (controlling the coupling strength between elements) relative to the amplitude of the frozen het- 
erogeneity of the stress thresholds controlling the earthquake nucleation on each fault segment, a characteris- 
tic earthquake regime with a truncated power law is found for small heterogeneity or large stress drop while 
the power law SOC regime is recovered by large heterogeneity or small stress drop. The two approaches 
of \Fisher et al.\ 1997[ \Dahmen ~e t al. , 1998, B en-Zion et al. \ |1999[ \Dah men and Ben-Zion, 2008] on the 
one hand and of Sornette et al. | 1995 1 on the other hand can be reconciled conceptually by noting that the 
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dynamic stress weakening of Dahmen and Ben-Zion | 1 2008 1 controls the dynamical generated stress hetero- 
geneity while the lack of stress conservation c controls the coupling strength. Fundamentally, the relevant 
control parameter is the degree of coupling between fault elements seen as threshold oscillators of relax- 
ation versus the variance of the disorder in their spontaneous large earthquake recurrence times. Generically, 
power law statistics are expected to co-exist with synchronized behavior in a general phase diagram in the 
heterogeneity-coupling strength plane | Sornette et al. 1995 Osorio et al. 2007) . 

Let us finally mention a promising but challenging theoretical approach, which has the ambition to bridge 
the small-scale physics controlled by thermal nucleation of rupture to the large-scale organization of earth- 
quakes and faults \ Ouillon and Sornette , 2005\ \Sornette and Ouillon\\20Q5j . Partial success has been obtained 
with a remarkable prediction on the ("multifractal") dependence of the Omori law exponent of aftershocks on 
the magnitude of the mainshock, verified by careful empirical analyses on earthquakes in California, Japan 
and worldwide \Ouillon et al. | 12007] . 



VI Empirical studies of seismicity inspired by statistical physics 

"False facts are highly injurious to the progress of science, for they often endure long; but false views, if supported by 
some evidence, do little harm, for everyone takes a salutary pleasure in proving their falseness." Charles Darwin, in The 
Origin of Man, Ch. 6. 



VI.l Early successes and latter subsequent challenges 

A significant benefit of the statistical physics approach to seismology has been the introduction of novel 
techniques to analyze the available empirical data sets, with the goal of obtaining new insights into the 
spatio-temporal organization of seismicity and of revealing novel regularities and laws that may guide the 
theoretical analysis. 

A prominent forerunner is the application of the concept of fractals introduced by Mandelbrot | 1982) and 
of the measures of fractal dimensions to describe complex sets of earthquake epicenters, hypocenters and 
fault patterns. The use of fractals has constituted an epistemologic breakthrough in many fields, and not only 
in seismology. Indeed, before Mandelbrot, when dealing with most complex systems, one used to say: "this is 
too complicated for a quantitative analysis" and only qualitative descriptions were offered. After Mandelbrot, 
one could hear: "this is a fractal, with a fractal dimension equal to xxx!" By providing a new geometrical way 
of thinking about complex systems associated with novel metrics, Mandelbrot and his fractals have extended 
considerably the reach of quantitative science to many complex systems in all fields. 

However, while there have been some attempts to use fractal dimensions as guidelines to infer the un- 
derlying organization processes, as for instance in ^Sornette et oT 1990a|b|, most of the init ial reports have 
lost their early appeal | Turcotte 1986; Scholz and Mandelbrot 1989 Barton and La Pointe 1995a b | since 
the complexity of seismicity and faulting is much too great to be captured by scaling laws embodying solely 
a simple scale invariance symmetry. Among others, multifractal and adapted wavelet tools are needed to 



quantify this complexity, see for instance | Geilikman et al. 1990 Ouillon et al. 1995 1996 Molchan and 



\Kronr od 2005 , ]Ouillon et a7T||2008| |. It should also be noted that few studies of the fractal dimensions of 
seismicity address the significant issues of errors, biases and incomplete records in earthquake catalogs - a 
notable exception being \Kagan 2007 1. 

Since the beginning of the 21st century, a renewal of interest and efforts have burgeoned as groups of 
statistical physicists, interested in earthquakes as a potential instance of self-organized criticality (SOC), 
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have claimed "novel", "universal" and "robust" scaling laws from their analysis of the spatio-temporal or- 
ganization of seismicity. The authors purport to have discovered universal and hitherto unknown features 
of earthquakes that give new insights into the dynamics of earthquakes and add to the evidence that earth- 
quakes are self-organized critical. We now discuss a few of these recent studies to illustrate the existence of 
potential problems in the "statistical physics" approach. In a nutshell, we show that perhaps most of these 
"novel scaling laws" can be explained entirely by already known statistical seismicity laws. This claim has 
been defended by other experts of statistical seismology, the most vocal being perhaps Yan Kagan at UCLA 
|gqgan||l999) . 

The flurry of interest from physicists comes from their fascination with the self-similar properties exhib- 
ited by seismicity (e.g. the Gutenberg-Richter power law of earthquake seismic moments, the Omori-Utsu 
law of the decay of aftershock rates after large earthquakes, the fractal and multifractal space-time organi- 
zation of earthquakes and faults, etc.), together with the development of novel concepts and techniques that 
may provide new insights. But, and this is our main criticism based on several detailed examples discussed 
below, many of the new approaches and results do not stand close scrutiny. This failure is rooted in two short- 
comings: (i) the lack of testing of new methods on synthetic catalogs generated by benchmark models which 
are based on well-known statistical laws of seismicity, and (ii) the failure to consider earthquake catalog bias, 
incompleteness and errors. The latter may cause catalog artifacts to appear as genuine characteristics of earth- 
quakes. Testing the results on a variety of catalogs and considering the influence of various catalog errors can 
help minimize their influence. The former short-coming often leads to the following scenario: authors fail to 
realize that a simpler null hypothesis could not be rejected, namely that their "discovery" could actually be 
explained by just a combination of basic statistical laws known to seismologists for decades. 

The well-established laws of statistical seismicity that authors should consider before claiming for novelty 
include the following: 

1. the Gutenberg-Richter law for the distribution of earthquake magnitudes with a &-value close to 1 
(corresponding to an exponent ~ 2/3 for the probability density function of seismic moments), 

2. the Omori-Utsu law for the decay of the rate of aftershocks following a mainshock, 

3. the inverse Omori law for foreshocks, 

4. the fact that aftershocks also trigger their own aftershocks and so on, and that aftershocks do not seem 
to exhibit any distinguishable physical properties, 

5. the fact that the distribution of distances between mainshocks and aftershocks has a power law tail, 

6. the fertility law (the fact that earthquakes of magnitude M trigger of the order of 10 aM aftershocks 
with a < b ~ 1, 

7. the fractal distribution of faults which are concentration centers for earthquakes. 

This above non-exhaustive list selects "laws" which are arguably non-redundant, in the sense that it is likely 
not possible to derive one of these laws from the others (a possible exception is the inverse Omori law for 
foreshock, which can be derived from the direct Omori law for aftershocks in the context of the ETAS model 
\Helmstetter et al. 2003 Helmstetter and Sornette 2003| ). Some experts would argue that we should add 



to this list other claimed regularities, such as "Bath's law" (see e.g. \Shcherbakov and Turcotte 2004 1 for 
a recent discussion emphasizing the importance of this law), that states that the differences in magnitudes 
between mainshocks and their largest aftershocks are approximately constant, independent of the magnitudes 
of mainshocks [ Ba th\ [19651 • However, Helmstetter a nd Sornet te [ 2003 1 ; \Saichev and Sornette] f2005 1 have 
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shown that Bath's law can be accurately recovered in ETAS-type models combining the first, second, fourth, 
and sixth laws stated above, with the assumption that any earthquake can trigger subsequent earthquakes. 



VI.2 Entropy method for the distribution of time intervals between mainshocks 



Mega et aT| | |2003) used the "diffusion entropy" method to argue for a power-law distribution of time intervals 
between a large earthquake (the mainshock of a seismic sequence or cluster) and the next one. Helmstetter 



and Sornette |2004| showed that all the "new" discoveries reported by Mega et al. | 2003| (including the 
supposedly new scaling) can be explained solely by Omori's law for intra-cluster times, without correlation 
between clusters, thus debunking the claim for novelty. 



VI.3 Scaling of the PDF of Waiting Times 



Bak et al. 1 2002 1 analyzed the scaling of the probability density function of waiting times between succes- 
sive earthquakes in southern California as a function of "box size" or small regions in which subsequent 
earthquakes are considered. They found an approximate collapse of the pdfs for different seismic moment 
thresholds S and box sizes L which suggested the following scaling ansatz for the waiting times T: 



T a P s , L ( T ) = f(TS- b L d f) 



(3) 



where b = 1 is the Gutenberg-Richter exponent, dt ~ 1.2 was claimed to be a spatial fractal dimension 
of seismicity (see Molchan and Kronrod |2005| and Kagan |2007| for more in-depth studies), a = 1 was 
identified as the exponent in the Omori law and /(•) is a scaling function which was proposed to be roughly 
constant up to a constant ("kink") beyond which it quickly decays. The scaling Q was claimed to be a unified 
law for earthquakes that revealed a novel feature in the spatio-temporal organization of seismicity in that the 
Gutenberg-Richter, the Omori law and the spatial distribution of earthquakes were unified into a single picture 
that made no distinction between fore-, main- and aftershocks. The scaling relations and critical exponents 
were claimed to be contained in the scaling ansat z. \Corral | 2003 2004a|b 2005a| and others broadened 
the analysis to other regions of the world. Corral | 2004a) proposed a slightly different scaling ansatz for a 
modified data analysis. 

Early criticism came from Lindman et al. [ 2005 ], who noted that generating synthetic data using a non- 



homogeneous Poisson process derived from Omori's law was able to reproduce some of the results of Bak 



etal. 1 2002 1 , indicating a rather trivial origin of the unified scaling law. Molchan | 2005 1 showed that, if at least 
two regions in the data set are independent, then, if a scaling relation were to hold exactly, this scaling function 
could only be exponential. All other functions could only result in approximate data collapses. Proponents 
of the unified scaling law, e.g. \Corral\ f2005b| , argued that indeed all regions were correlated, as expected 
in systems near a critical point so that the assumption of independence between different regions should not 
hold. But Molchan 1 2005 1 also showed that a simple Poisson cluster model (Poissonian mainshocks that 
trigger Omori-type aftershock sequences) could reproduce the short and long time limits of the observed 
statistics, indicating that the Omori law, the Gutenberg-Richter relationship and simple clustering were the 
sole ingredients necessary for the observed short and long time limit, and no spatial correlation was needed. 
Saichev and Sornette | 2006b 2007 1 extended Molchan's arguments to show that the approximate data 



collapse of the waiting times could be explained completely by the Epidemic-Type Aftershock Sequence 
(ETAS) model of Ogata |1988|. This provided further evidence that the apparent data collapse was only 
approximate. Remarkably, the theoretical predictions of the ETAS model seem to fit the observed data better 



than the phenomenological scaling function proposed by Corral 1 2004a | to fit the data. Saichev and Sornette 
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1 2006b 2007 1 thus showed that a benchmark model of seismicity was able to reproduce the apparent unified 
scaling law and that therefore the distribution of interevent times did not reveal new information beyond what 
was already known via statistical laws: The combination of the Gutenberg-Richter law, the Omori law, and 
the concept of clustering suffice to explain the apparent "universal" scaling of the waiting times. 



Sornette et al. | 2008] developed an efficient numerical scheme to solve accurately the set of nonlinear 



integral equations derived previously in \Saichev and Sornette 2007] and found a dramatic lack of power for 
the distribution of inter-event times to distinguish between quite different sets of parameters, casting doubt on 
the usefulness of this statistics for the specific purpose of identifying the clustering parameter (e.g. \Hainzl 
\etaT\ [20061 ). 



VI.4 Scaling of the PDF of Distances Between Subsequent Earthquakes 



Davidsen and Paczuski [2005] claimed evidence contradicting the theory of aftershock zone scaling in favor 



of scale-free statistics. Aftershock zone scaling refers to the scaling of the mainshock rupture length, along 



which most aftershocks occur, with the mainshock magnitude [Kagan. 2002 1. Davidsen and Paczuski (2005} 
suggested that the probability density function of spatial distances between successive earthquakes obeys 
finite size scaling with a novel dynamical scaling exponent, suggesting that the mainshock rupture length 
scale has no impact on the spatial distribution of aftershocks and that earthquakes are self-organized critical. 



\Werner and Sornette [ 2007 1 debunked this claim by showing that (i) the purported power law scaling 
function is not universal as it breaks down in other regions of the world; (ii) the results obtained by Davidsen 
\and Paczuski\ |2005[ for southern California depend crucially on a single earthquake (the June 28, 1992, 
M7.3 Landers earthquake): without Landers and its aftershocks, the power law disappears; (iii) a model of 
clustered seismicity, with aftershock zone scaling explicitly built in, is able reproduce the apparent power law, 
indicating that an apparent lack of scales in the data does not necessarily contradict aftershock zone scaling 
and the existence of scales associated with mainshock rupture length scales. 



VI.5 The Network Approach 

The recent boom in the statistical mechanics of network analysis has recently extended to applications well 



beyond physics (for reviews, see | Barabdsi and Albert 1999 Albert and Barabdsi 2002 Dorogevtsev and 



Mendes\ [2003| \Newman\ |2003|). E arthquake seismology is no exception [Baiesi and Paczuski] |2004| \Peixoto 
and Prado\ |2004[ \Abe and Suz uki 2004a b ; Baiesi and Paczuski 2003] \Abe andSu zuki 2005 1 B aiesi\\2006 



Abe and Suzuki. 200_6 Peixoto and Prado 2006). The resulting impact has been limited so far for several 
reasons. 

A major concern is the assumption that earthquake catalogs as downloaded from the web are data sets 
fit for immediate analysis. Kagan [ 2003] ; Helmstetter et al. [ 2005 1; Werner [2007 1 and Werner and Sornette 



[2008| present modern and complementary assessments of the many issues of incompleteness spoiling even 
the best catalogs. In particular, we should stress that magnitude determinations are surprisingly inaccurate, 
leading to large errors in seismic rate estimates [Werner and Sornette 2008 1. Furthermore, there is no such 
thing as a complete catalog above a so-called magnitude of completeness, due to the fact that a non-negligible 



fraction of earthquakes are missed in the aftermath of previous earthquakes | Kagan 2003 Helmstetter et al. 



2005 1 . One should be concerned that analyses in terms of network metrics could be particularly sensitive to 



these defects. Nevertheless, §Abe a nd Suzuki 2004a b 2005 1 [2006 1 applied metrics of network analysis to 
"raw" catalogs which included events well below the estimated magnitude of completeness. As a result of 
neglecting to use a (reasonably) homogeneous and trustworthy data set, the results of their analysis may be 
severely biased, because the reliability of the inferred network structure is likely more sensitive than other 
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metrics to the correct spatio-temporal ordering of the earthquake catalog. No serious study has yet been 
performed to quantify the usually serious impact of quality issues on the metrics used in network analysis. 
As a consequence, it is also not clear how to interpret the "success" of \Peixoto and Prado 2004 , 2006 1 in 
reproducing the "features" of Abe and Suzuki's analysis on the synthetic seismicity generated by a spring- 
block model. 

In addition, at best limited attempts have been made to interpret the results of the new network metrics 
using well-known, established facts in seismology. Many of the claimed novel features are probably very 
well understood - they are mostly related to scale-invariance and clustering of seismicity, facts documented 
for decades. The authors should always strive to show that the new metrics that they propose give results 
that cannot be explained by the standard laws in statistical seismology. In this goal, there are well-defined 
benchmark models that incorporate these laws, that generate synthetic catalogs on which the new metrics can 
be tested and compared. 

A few exceptions are worth mentioning. Motivated by the long-standing and unresolved debate over 
"aftershock" identifcation, Baiesi and Paczuski |2004 2005) and Baiesi 1 2006 1 provided a new metric for 
the correlations between earthquakes based on the space-time-magnitude nearest-neighbor distance between 
earthquakes. The authors compared their results with known statistical laws in seismology and with the 
predictions of the ETAS model, actually confirming both. While no new law has been unearthed here, such 



efforts are valuable to validate known laws and continue to test the possible limits. A recent preprint | Zaliapin 



\et al.\ |2007| extended their study and investigated the theoretical properties of the metric and its ability to 
decluster catalogs (i.e., separate mainshocks from aftershocks). They conclude that aftershocks defined from 
this metric seem to be different from the rest of earthquakes. It will be interesting to see head-to-head 
comparisons with current state-of-the-art probabilistic declustering techniques that are based on empirical 
statistical laws and likelihood estimation \Kagan] [l991\\Zhuang et q£| |2002 2004 1. 



VII Future Directions 

The study of the statistical physics of earthquake remains wide-open with many significant discoveries to 
be made. The promise of a holistic approach - one that emphasizes the interactions between earthquakes 
and faults - is to be able to neglect some of the exceedingly complicated micro-physics when attempting 
to understand the large scale patterns of seismicity. The marriage between this conceptual approach, based 
on the successes of statistical physics, and seismology thus remains a highly important domain of research. 
In particular, statistical seismology needs to evolve into a genuine physically-based statistical physics of 
earthquakes. 

The question of renormalizability of models of earthquake occurrence and the role of small earthquakes 
in the organization of seismicity is likely to remain an important topic. It connects with the problem of fore- 
shocks and the predictability of large events from small ones and therefore has real and immediate practical 
applications as well as physical implications. 

More detailed and rigorous empirical studies of the frequency-size statistics of earthquake seismic mo- 
ments and how they relate to seismo-tectonic conditions are needed in order to help settle the controversy 
over the power-law versus the characteristic event regime, and the role of regime-switching and universality. 

Spatially extended, dynamically evolving fault networks and their role in the generation of earthquakes 
are mostly ignored in the statistical physics approach to seismicity. Akin to the filaments in turbulence, these 
may provide key insights into the spatio-temporal organization of earthquakes. Novel methods combining 
information from seismology to faulting will be required (e.g., \Sornette et al. 1990a|b| Sornette |1991 



Gorshkov et al. 2003 Ouillon et al. 2008 1) to build a real understanding of the self-organization of the 
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chicken-and-egg structures that earthquakes-faults constitute. Furthermore, a true physical approach requires 
understanding the spatio-temporal evolution of stresses, their role in earthquake nucleation via thermally 
activated processes, in the rupture propagation and in the physics of arrest, both involved in the generation of 
complex stress fields. 

The important debate regarding statistical physics approaches to seismicity would benefit significantly 
from two points. Firstly, earthquake catalogs contain data uncertainties, biases and subtle incompleteness 
issues. Investigating their influence on the results of data analyses inspired by statistical physics increases the 
relevance of the results. Secondly, the authors should make links with the literature on statistical seismology 
which deals with similar questions. It is their task to show that the new metrics that they propose give 
results that cannot be explained by the standard laws in statistical seismology. For this, there are well-defined 
benchmark models that incorporate these laws, that generate synthetic catalogs on which the new metrics can 
be tested. 
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